A numerical differentiation library exploiting parallel architectures
نویسندگان
چکیده
Article history: Received 7 July 2008 Received in revised form 15 January 2009 Accepted 10 February 2009 Available online xxxx PACS: 02.60.Jh 02.60.Pn 02.70.Bf
منابع مشابه
Ultra-Low-Energy DSP Processor Design for Many-Core Parallel Applications
Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...
متن کاملA Parallel Numerical Library for UPC
Unified Parallel C (UPC) is a Partitioned Global Address Space (PGAS) language that exhibits high performance and portability on a broad class of shared and distributed memory parallel architectures. This paper describes the design and implementation of a parallel numerical library for UPC built on top of the sequential BLAS routines. The developed library exploits the particularities of the PG...
متن کاملA Portable Subroutine Library for Solving Linear Control Problems on Distributed Memory Computers
This paper describes the design of a software library for solving the basic computational problems that arise in analysis and synthesis of linear control systems. The library is intended for use in high performance computing environments based on parallel distributed memory architectures. The portability of the library is ensured by using the BLACS, PBLAS, and ScaLAPACK as the basic layer of co...
متن کاملTACO-Exploiting Cluster Networks for High-Level Collective Operations
TACO (Topologies and Collections) is a template library that introduces the flavour of distributed data parallel processing by means of reusable topology classes and C++ templates. This paper introduces TACO’s basic abstractions and provides a performance analysis for basic collective operations on various cluster architectures with several different networks.
متن کاملPerformance Characteristics of a Cosmology Package on Leading HPC Architectures
The Cosmic Microwave Background (CMB) is a snapshot of the Universe some 400,000 years after the Big Bang. The pattern of anisotropies in the CMB carries a wealth of information about the fundamental parameters of cosmology. Extracting this information is an extremely computationally expensive endeavor, requiring massively parallel computers and software packages capable of exploiting them. One...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Physics Communications
دوره 180 شماره
صفحات -
تاریخ انتشار 2009